Hierarchical Spatial Process Models for Multiple Traits in Large Genetic Trials.
نویسندگان
چکیده
This article expands upon recent interest in Bayesian hierarchical models in quantitative genetics by developing spatial process models for inference on additive and dominance genetic variance within the context of large spatially referenced trial datasets of multiple traits of interest. Direct application of such multivariate models to large spatial datasets is often computationally infeasible because of cubic order matrix algorithms involved in estimation. The situation is even worse in Markov chain Monte Carlo (MCMC) contexts where such computations are performed for several thousand iterations. Here, we discuss approaches that help obviate these hurdles without sacrificing the richness in modeling. For genetic effects, we demonstrate how an initial spectral decomposition of the relationship matrices negates the expensive matrix inversions required in previously proposed MCMC methods. For spatial effects we discuss a multivariate predictive process that reduces the computational burden by projecting the original process onto a subspace generated by realizations of the original process at a specified set of locations (or knots). We illustrate the proposed methods using a synthetic dataset with multivariate additive and dominant genetic effects and anisotropic spatial residuals, and a large dataset from a scots pine (Pinus sylvestris L.) progeny study conducted in northern Sweden. Our approaches enable us to provide a comprehensive analysis of this large trial which amply demonstrates that, in addition to violating basic assumptions of the linear model, ignoring spatial effects can result in downwardly biased measures of heritability.
منابع مشابه
Analysis of Hierarchical Bayesian Models for Large Space Time Data of the Housing Prices in Tehran
Housing price data is correlated to their location in different neighborhoods and their correlation is type of spatial (location). The price of housing is varius in different months, so they also have a time correlation. Spatio-temporal models are used to analyze this type of the data. An important purpose of reviewing this type of the data is to fit a suitable model for the spatial-temporal an...
متن کاملComparison of Linear and Threshold Models for Estimation Genetic and Phenotypic Parameters of Success of Conception at First Service and Inseminations to Conception in Holstein Cattles in East Azarbayjan Province
In this research genetic and phenotypic parameters were estimated using linear and threshold models, for reproductive traits, data from 6 large industrial dairy herd of East Azerbaijan province collected by Agriculture Jihad Organization during 10 years (2001-2010). Best linear unbiased predictions of traits breeding values were estimated using Restricted Maximum Likelihood method by WOMBAT sof...
متن کاملComparison of Linear and Threshold Models for Estimation Genetic and Phenotypic Parameters of Success of Conception at First Service and Inseminations to Conception in Holstein Cattles in East Azarbayjan Province
In this research genetic and phenotypic parameters were estimated using linear and threshold models, for reproductive traits, data from 6 large industrial dairy herd of East Azerbaijan province collected by Agriculture Jihad Organization during 10 years (2001-2010). Best linear unbiased predictions of traits breeding values were estimated using Restricted Maximum Likelihood method by WOMBAT sof...
متن کاملSpatial Design for Knot Selection in Knot-Based Low-Rank Models
Analysis of large geostatistical data sets, usually, entail the expensive matrix computations. This problem creates challenges in implementing statistical inferences of traditional Bayesian models. In addition,researchers often face with multiple spatial data sets with complex spatial dependence structures that their analysis is difficult. This is a problem for MCMC sampling algorith...
متن کاملSpatiotemporal Estimation of PM2.5 Concentration Using Remotely Sensed Data, Machine Learning, and Optimization Algorithms
PM 2.5 (particles <2.5 μm in aerodynamic diameter) can be measured by ground station data in urban areas, but the number of these stations and their geographical coverage is limited. Therefore, these data are not adequate for calculating concentrations of Pm2.5 over a large urban area. This study aims to use Aerosol Optical Depth (AOD) satellite images and meteorological data from 2014 to 2017 ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of the American Statistical Association
دوره 105 490 شماره
صفحات -
تاریخ انتشار 2010